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We present measurements of the linear diameter of the emission region of the 
Vela pulsar at observing wavelength A = 18 cm. We infer the diameter as a func- 
tion of pulse phase from the distribution of visibility on the Mopra-Tidbinbilla 
baseline. As we demonstrate, in the presence of strong scintillation, finite size of 
the emission region produces a characteristic W-shaped signature in the projec- 
tion of the visibility distribution onto the real axis. This modification involves 
heightened probability density near the mean amplitude, decreased probability 
to either side, and a return to the zero-size distribution beyond. We observe 
this signature with high statistical significance, as compared with the best-fitting 
zero-size model, in many regions of pulse phase. We find that the equivalent 
full width at half maximum of the pulsar's emission region decreases from more 
than 400 km early in the pulse to near zero at the peak of the pulse, and then in- 
creases again to approximately 800 km near the trailing edge. We discuss possible 
systematic effects, and compare our work with previous results. 

Subject headings: methods: data analysis - techniques - stars: pulsars - pulsars: 
individual: Vela pulsar - interstellar scattering 



1. INTRODUCTION 

Pulsars emit strong radio emission from compact regions. The enormous magnetic fields 
and rapid rotation of neutron stars easily accelerate electrons and positrons to high energy, 
but the means by which a small fraction of that energy is transformed to radio emission 
remains poorly understood. Pulsar emission regions are small, but interstellar scattering 
of radio waves provides an astronomical-unit-scale lens with the nanoarcecond resolution 
sufficient to resolve them spatially. However, the lens is highly corrupt, and unraveling 
source structure involves application of statistical models to large volumes of high-quality 
data. These models must include accurate descriptions of the effects of scattering and of 
noise. 

In this paper, we describe observations and data analysis to fit a simple model for 
a spatially-extended emission region to the interferometric visibility statistics of the Vela 
pulsar. We focus on description of the models, the data, and comparisons of the two. In 
this introductory section, we briefly describe, as background, the features of interstellar 
scattering important for our technique and the basic features of pulsar physics involved in 
the interpretation. 
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1.1. Pulsar Emission Physics 



Although pulsars have been observed for more than 40 years, the process by which a 
rapidly-rotating, magnetized neutron star converts a small fraction of its rotational energy 
to radio waves remains unclear. The rapid rotation would produce v x B forces and induced 
electric fields ample to tear electrons from the surface of the neutron star, were those forces 
not cancelled by an induced co- rotating charge distribution (jGoldreich fc Julian! 1 196 9 ) . The 
magnetic field of the neutron star away from its s urface is nearly d ipole, modified by inertia 
of the corotating particles and relativistic effects (jSpitkovskyl 120061 ) . "Open" field lines pass 
through the light cylinder, the surface where the co- rotation speed is that of light. These 
field lines carry highly-relativistic charged particles away from the star, forming a powerful 
wind. A small fraction of the energy of this wind is apparently converted to radio emission, 
observed as pulses because of stellar rotation. 

The boundaries of the set of open field lines form promising places for the origin of 
pulsar emission. Above the "polar cap" at the base of the open field lines, the current 
may be insufficient to replace the outflowing charge, so tha t a gap may form, with strong 



electric field parallel to the nearly-vertical magnetic field (IRuderman fc Sutherland! 11975 



Arons fc Scharlemannlll979l ). Similarly, a gap may form between the last closed field lines, 
which nearly graze the light cylinder, and the open field lines. Proposed locations include 
a "slot gap" extending to high altitude from the polar c ap (Muslimov fe Harding] 2004 ) 



and charge-free "outer gaps" in th e outer magnetosphere (jCheng. Ho. fc Ruderman 



1986 



Chang. Ruderman. &: Zhang! 120001 ). Within these gaps, electrons and positrons acceler- 
ate to TeV energies, accompanied by pair creation. These particles can emit X-rays and 
gamma-rays via curvature, synchrotron, and inverse Compton emission. Force-free simula- 
tions of pulsar magnetospheres cannot include gaps, but interestingly predict strong currents 



( Spitkovskv 


2006; 


Gruzinov 


2007) 



power source for radio emission. 

The radio emission is coherent: the o bserved ~ 10 26 K brightne ss temperature ex 



ceeds that possible for individual electrons ([Manchester fc Taylor! 1 19771 ). The nanosecond 
yariability of giant p ulses indicates that the emission originates in structures ~ 1 m across 
(IHankins et al.ll2003l ). These structures are likely distributed over scales comparable to those 
of the particle-acceleration zones: larger than the polar cap, ~ 1 km, but smaller than the 
diameter of the light-cylinder. In this work, we seek to determine the lateral dimension of 
this region of emission. 

Emission could arise directly from particles traveling along the field lines, in which case 
polarization and temporal variations would directly reflect conditions at the source. Al- 
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ternatively, emission may be reprocessed before it l eaves the pulsar's magne t osphere, as is 



suggested by observations and theoretical models (jLyutikov fc Parikhl |2000| ; iJessner et al. 



2010l ). Radio emission could also arise via production of plasma waves from plasma instabil- 



ities on open field lines. These plasma waves would propagate nearly along field lines, and 
then be co nverted to radio waves w here the local plasma frequency falls below the observed 
frequency (IBarnard fc Arond 119861 ) . Our measurements can contribute to this picture by 
describing the lateral scale of the emission region. 



1.2. Interstellar Scattering and Scintillation 



Radio waves emitted from a pulsar encounter variations in refractive index in the in- 
terstellar medium, from variations in electron density. Waves travel onward with "crinkled" 
phase fronts and arrive at an observer from along a number of paths to form a diffraction 
pattern. For the interstellar medium at decimeter or longer wavelengths, the differences in 
path lengths are many wavelengths, and many paths contribute to the diffraction pattern at 
the observer (I Cohen fc Cronynl|l974j ). 



The diffraction pattern at the observer is the convolution of an im age of the source 
with the pattern for a point source ( jCornwell et al.lll989l : lGoodmanlll996l ) . This results from 
the fact that, for small deflections, the Kirchoff integrals that relate the electric field at the 
observer to those at the source become Fourier tr ansforms, with the effects of the scattering 
medium inserted multiplicatively at the screen (IGwinn et al.lll998l ). Geometrical factors, 
depending on the position of the screen, relate the original source and corrupt image by a 
magnification factor M = D/R, where D is the distance from observer to scatterer, and R 
is the distance from scatterer to source. 

Because path differences are thousands of wavelengths, paths that reinforce at one 
observing frequency and position in the observer plane may cancel at nearby frequencies or 
positions. For an observed angular extent #iss of the scattering disk and observing wavelength 
A, the scale of the diffraction pattern at the observer is Siss = V^iss> equal to the linear 
resolution of the scattering disk viewed as a lens. The source shows scintillations, or intensity 
variations, with timescale Atiss = Siss/Kl as the line of sight sweeps through scattering 
material at speed V±. A sharp pulse at the pulsar will arrive over a range of times r ISS at the 
observer; the uncertainty principle relates Tigs to the bandwidth of the scintillations Ai/jss = 
l/(27TTiss)- Over each element of the scintillation pattern Auiss x Atiss, the propagation 
changes the electric field by a complex gain: a random amplitude and phase. At the source, 
the characteristic scale is MSiss', a displacement of the source by this length has an effect that 
is statistically equivalent to displacement of the observer by Siss- The above basic parameters 
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define the arena of our "interstellar telescope": the high resolution of the scattering disk, 
acting as a lens, produces the statistical structure of the diffraction pattern in the observer 
plane, which is then modified by the structure of the source. 

The Vela pulsar is particularly attractive for statistical studies of interstellar scattering 
because its scintillation bandwidth is relatively narrow at decimeter observing wavelengths, 
so that many samples can be accumulated quickly. The pulsar is strong, so the signal-to- 
noise ratio within one scintillation element is high. The pulsar is scattered enough that 
the diameter of the scattering disk can be measured with Earth-based interferometry; this 
measurement allows us to determine the location of the scattering screen along the line of 



"1-1— 1 

( Gwinn 


2001; 


Shishov 


2010) 



1.3. Pulsar Emission Structure via Interstellar Scattering 



A variety of authors have reported investigations of pulsar size using interstellar scat- 
tering. The observables address either motion of the emission centroid, or the size of the 
emission region. The present work falls into the second category. 

Measurements of, or upper limits on, motion of the centroid of emission rely upon 
the reflex shift of the scintillation pattern in the observer plane, when the pulsar rotates 
(ICordes. Boriakoff. fc Weisberg||l983t IWolszczan fc Cordeall9871 ; ISmirnova. Shishov. fc Malofeev 



1996 



Gupta. Bhat. fc Rao 



19961 ) . The observations measure such shifts from correlation of 



the scintillation spectrum at different phases of the pulse, with spectra at later or earlier 
times. Because motion of the source dominates changes in the scintillation pattern at the 
observer plane over these timescales, such a correlation, with information on the location of 
the screen, yields the shift of the source. These observations commonly find scales ranging 
from a few hundred to a few thousand km, or up to the diameter of the light cylinder. 

The phrase "stars twinkle, planets do not" expresses the fact that a finite source emis- 
sion region can decrease the modulation by scintillation. F rom the depth of modulation of 



scinti llation, one can infer the size of the emission region ( iCohen et al.l Il966l ; iGwinn et al. 



19981 ). This technique was used before the advent of synthesis interferometry to measure 



source sizes from scintillation in the int erplanetary medium, ( iReadhead fc Hewishl 11972 



Hewish. Readhead. fc Duffet-Smithlll974j ). In "s trong" scintillation, the modulation index 
m = a/ (I 2 )/ (I) 2 — 1 of a point source is 100% ( Cohen fc Cronynl 1 19741 ); for most pulsars 
at decimeter wavelengths, scintillation is very strong. Thus, modul ation of less than 100% 
would suggest the presence of source structure, on the scale MSiss- Macquart et al.l (120001 ) 
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report an upper limit on the size of the Vela pulsar at A = 45 cm observing wavelength, 
from the modulation index, although they do not discuss effects of intrinsic variability of the 
pulsar or self-noise; these can also affect the modulation index. 

We modify earlier approaches to measuring, or setting limits on, the size of the source 
from modulation, in that we fit a model to the distribution of intensity (or interferometric 
visibility). Source size affects the smallest intensities or visibilities most strongly. Intrinsic 
intensity variations and self-noise affect the largest intensities most strongly, as we discuss in 
Section 14.2.21 All of these effects change the distribution function of intensity and visibility, 
and thus their moments. 

The modulation index is a combination of first and second moments of intensity, and 
thus is most sensitive to the largest intensities. Thus, it is least sensitive to the shape of 
the distribution function where effects of source size are largest, and most sensitive where 
they are smallest. The full distributions of intensity or visibility provide more sensitive and 
complete information. They can distinguish among effects that alter these distributions in 
different ways. 



1.4. Distributions of Electric Field, Intensity, and Visibility 

Scintillation affect s the electric field of an a strophysical source at a particular frequency 
by a gain and a phase (IGwinn fc Johnson! 1201 ll ). here combined into the complex "scintilla- 
tion gain" g. For a point source in strong scattering, g is drawn from a circular Gaussian 
distribution in the complex plane, with zero mean. This behavior is a consequence of the 
large differences of path lengths, and the fact that many paths contribute to the signal 
received at the observer; the Central Limit theorem then implies, under rather general as- 
sumptions, that the scintillation gain resulting from that sum over paths is drawn from a 
Gaussian distribution. This result is independent of assumptions about the distribution of 
scattering material; for example, scattering in an extended or inhomogen eous medium can 
be treated quite generally via path integrals, and the same result holds (IFlattel Il979f ) . In 
the time domain, the received signal is the emitted signal convolved wi th a kernel q that 
descr ibes the scattering medium; g and g form a Fourier transform pair ( IGwinn fc Johnson 
201 lh . 



Because the scintillation gains are draws from a complex Gaussian distribution, their 
square modulus is drawn from an exponential distribution. An exponential distribution is 



(Scheuer 


1968; 


Gwinn et al. 


1998) 
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antennas are drawn from correlated, complex Gaussian distributions. Their product, the 
interferometric visibility, is drawn from the distribution of the product of su ch quantities . 
This distribution is a zero-order modified Bessel function times an exponential (lGwinnll200ll ). 
In practice, to obtain these distributions the observer must average the intensity, or the 
interferometric visibility, over many samples of the random electric field of the source. This 
field is itself noiselike, and contributes to noise in the measurement via self-noise, as we 
discuss below. 

If the source is extended, different points on the source will have different scintillation 
gains. These gains decorrelate as the separation between points on the source plane increases. 
A source consisting of two separated point sources provides a simple example. If the two 
parts are separated by much less than MSiss, then the gain factors are identical and the 
result is that for a point source above; if the separation is much greater than MSiss, then 
the observer records two superposed, independent scintillation patterns. If the sources are 
incoherent, then the observed intensity is the sum of the two; and the distribution of observed 
intensity is the convolution of two exponential distributions. The analogous results hold for 
interferometric visibility. For a small, but extended source, the distributions of gains and 
phases for the different parts of the source are correlated; however, they can be expressed as 
the co nvolution of th e original distribution with distributions of the same form, but of smaller 
scales ( lGwinnll200lf ). As discussed in more detail below, finite size tends to concentrate the 
distribution of intensity near the mean, and to soften the sharp cusp of the point-source 
distribution. 

Noise and intrinsic variations of flux density both broaden the observed distributions of 
intensity and interferometric visibility. Noise includes contributions both from backgrounds 
and from the noiselike source. Backgrounds are nearly indep endent of the flux density 



of th e source, with small corrections for quantization effects (IGwinnl 120061 ; iGwinn et al. 



20121 ). Source noise has standard deviation proportional to the flux densit y of the source 



it is termed "heteroscedastic," indicating that the variance is not constant ( jOslowski et al. 



201ll ). In combination, these contributions lead to variance of the noise given by a quadratic 
polynomial in phase with the signal, and the linear terms of that polynomial at quadrature 
( IGwinn et al.l l201ll 120121 ) . Intrinsic variation s of flux density can be divided into 3 regimes 
according to time scale (IGwinn et al.ll201lf ). The time to accumulate one sample of the 
spectrum is the product of the sampling rate and the number of spectral channels, termed the 
"accumulation time". Variations shorter than the accumulation time introduce correlations 



but do not change the noise (IGwinn fc Johnson! 1201 if ). Intermediate-term variations, longer 
than the accumulation time but shorter than the integration time, contribute to noise. Long- 
term variations, longer than the integration time, lead to a superposition of distributions with 
different mean flux density. Both noise and amplitude variations thus act in characteristic 
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ways the can be distinguished from effects of size; moreover, both broaden the distribution 
rather than narrowing it, as does finite size. 

1.5. Outline of Paper 

This paper focuses on our data and technique to estimate the size of the Vela pulsar's 
emission region from the distribution of interferometric visibility. This measurement requires 
data with stationary instrumental gain, well-characterized noise, and rapidly-sampled, gated 
correlation. Our analysis involves models for the effects of scintillation, noise, and amplitude 
variations of the pulsar. 

In Section [2j we describe our observations of the Vela pulsar, correlation and gating 
using the DRAO VLBI correlator, calibration, and fringing. We then present typical data 
and describe the formation of our data histograms. 

Next, in Section [3J we outline our analysis. This analysis involves calculation of model 
histograms and fits to data. We calculate the distribution of interferometric visibility in the 
complex plane, for a small, circular source, in Section 13.21 We describe the distribution of 
noise in Section 13.3} and demonstrate how a superposition of distributions can model the 
pulsar amplitude variations in Section 13.41 We discuss the combination of these effects and 
evaluation of the model in Section 13.51 In particular, we demonstrate that the signature 
of finite source size is a W-shaped difference of the best-fitting finite-size model from the 
best-fitting zero-size model. We detail the numerical evaluation techniques in Section 13.61 
and fitting techniques in Section 13.71 

We present our results in Section HI We first discuss an example fit in detail and show 
that the data histogram displays the characteristic W-shaped signature of source size. We 
then give our results for all gates and spectral ranges. We discuss various systematic effects 
that can contribute to, or bias, the inferred source size. We briefly compare our results with 
previous results at A = 13 cm, and compare with other observational studies of the Vela 
pulsar's emission region. 

In Section [5j we summarize our results. 
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2. OBSERVATIONS, CORRELATION, AND CALIBRATION 

2.1. Observations 

We observed the Vela pulsar on 10 Dec 1997 using a network comprising antennas at 
Tidbinbilla, Mopra, Hartebeesthoek, and the VSOP spacecraft. The observations began at 
14:15 UT and ended at 22:40 UT, for a time span of 8:25. The observations were made at 18 
cm observing wavelength, with left-circular polarization. We recorded two frequency bands 
(IFs), of 16 MHz each, at each antenna. The bands spanned 1634 to 1650 MHz (IF1) and 
1650 to 1666 MHz (IF2). Data were digitized (quantized and sampled) at recording time. 



2.2. Correlation 



The data were correlated with the Canadian S2 VLB correlator ( ICarlson et al.lll999l ). 
This correlator is a reduced-table 4-level correlator. Each IF was correlated separately with 
8192 lags to form a cross-correlation function. The correlator was gated synchronously with 
the pulsar pulse, in 6 gates across the pulse. Each gate was 1 msec wide. The first 5 gates 
covered the pulse, as shown in Figure [TJ The sixth gate was located far from the pulse, 
when the pulsar is "off". We averaged the results of the correlation for 2 sec, or 22.4 pulsar 
periods; except on the baselines to the spacecraft, which we averaged for 0.5 sec, or 5.6 pulsar 
periods. 



2.3. Editing and Fringing 

2.3.1. Editing 

The data were recorded in single sidebands; thus, the spectra contained 8192 chan- 
nels, each with bandwidth 1.95 kHz. The cross-power spectra are complex. The phase 
i ncludes instrumental effects, primar ily observational and instru mental delays an d rates 
( Thompson. Moran. fc Swensonl fl986"l ) : and effects of scintillation ( Desai et al. 1992 ). 



We found that the time period from 19:10 to 21:13 UT on the Mopra-Tidbinbilla base- 
line contained data with uniform high gain and low noise; absence of interference or gaps 
in correlation; and little change in length and orientation of the baseline. The coordi- 
nates projected perpendicular to the source direction vary over the range of (u,v,w) from 
(422.8, 1416.9, -392.67) to (733.5, 1193.7, -614.5) /xs. The projected baseline length is ap- 
proximately 428 km. 
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We edited the data to remove times and channels with interference or corrupt recording. 
We identified channels that showed evidence of interference such as high amplitude or high 
noise. This amounted to 27 channels for IF1 and 12 channels for IF2. We also identified 
time records with excessively low or high amplitudes, or with low correlation amplitudes, 
and removed those. 



2. 3. 2. Fringing 



We corrected for average delay and rate by fringe- fitting (see lThompson. Moran. fc Swenson 



19861 ). Delay represents a phase slope with frequency, and rate a phase slope with time. We 
fringed the central 7168 channels of Gate 2, leaving "guard zones" 512 channels wide on 
each end, for the regions where passband gain rolled off and instrumental phase varied most 
rapidly. We formed a 2-dimensional discrete Fourier transform to find t he fringe rate for each 



8-sam ple (16-sec) time interval, using the traditional "fringe" algorithm ([Thompson. Moran. fc Swenson 



1986|). 



We then used the fringe rate and delay from Gate 2 to remove the corresponding phase 
slopes from the other "slaved" pulsar gates. In tests with pairs of gates that contained 
strong signal, we verified that delay and rate were the same for all gates, to the accuracy 
permitted by signal-to-noise ratio. However, we found that the residual interferometer phase 
depended on pulsar gate. We therefore used the phase estimated from each gate to correct 
that gate. For this paper, the primary purpose of fringe-fitting was to remove instrumental 
phase, leaving only the effects of scintillation and those of statistical noise in the data. 



2.3.3. Dynamic Cross-power Spectra 



The calibrated data take the form of complex cross-power spectra, sampled as a function 
of time, in 6 pulse gates and 2 IFs. These data were gathered for several baselines, as 
discussed in iGwinn et al.l ( 120121 ). In this paper, we focus on the relatively short Mopra- 
Tidbinbilla baseline. As an example, Figure [2] shows the real part of the visibility, for a 
short span of time and frequency in 3 gates. The relative variation in amplitude between 
the gates has been removed by calibration, using the average amplitude over the spectral 
range for the time span of all the data, as shown in Figure [TJ The ratio of calibration factors 
was 24 : 36 : 13 for gates 1, 2, and 3. The observed spectra differ because of the effects 
of noise, and because of variations in the relative amplitudes of individua l pulses at the 



different pulse phases (IKrishnamohan fc Downs! 11983 



Johnston et al. 2001; Kramer et al. 
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20021 ; iGwinn et al.ll2012t I Johnson fc Gwinnll2012l ). The scattering medium is expected not 



to change between gates, which after all integrate over the same time interval; thus, the 
scattering pattern should be the same, allowing for variations in noise and pulse-to-pulse 
variability. An interesting question is whether differences might additionally reflect changes 
in the structure of the pulsar's emission region between gates. Because the effects of noise 
and amplitude variations are random, this issue can only be treated statistically, using the 
correct descriptions of noise and amplitude variation. Complicating this comparison is the 
fact that effects of source size are largest when scintillation leads to small flux densities 
( IGwinn et al.lll998l ). The remainder of this paper makes such a statistical comparison. 



2.3.4- Histograms 

We reduce the observational data to a histogram of measurements of the real part Vn, 
and a histogram weighted by the mean square imaginary part Qn- The subscript "N" de- 
notes that these distributions reflect measured histograms rather than model distributions. 
Mathematically, these histograms correspond to sums over the observed points V{y, t), re- 
stricted to bins of width w about the bin centers, at real part X k : 

V N (X k ) = 1 for X k - w/2 < Re[V(v, £)] < X k + w/2 (1) 

v,t 

Q N {X k ) = lm[V(u, t)f for X k - w/2 < Re[V(v, t)} < X k + w/2. 



v,t 



We sought to make the histograms from sufficiently narrow spectral ranges so that the am- 
plitude did not vary greatly across the spectrum, while including enough points for robust 
statistics. Most of the spectral va riation arises f rom t he shape of the pulse and pulse disper- 



sion, as Figured], and Figure 1 of IGwinn et al.l (120121 ). suggest. We adopted spectral ranges 



of 1024 channels within each 8192-channel spectrum from each gate. We dropped the first 
and the last 1024 channels, to avoid effects of gain rolloff and phase variations near the edges 
of the observed band. Consequently, our data are indexed by IF number (1 or 2), by gate (1 
through 5), and by channel range in increments of 1024, beginning at 1024 to 6144. Figure 
[T] shows the centroids of the 1024-channel ranges for Gate 2. 

Figure [3] shows examples of the measured histograms, for IF1, gate 1, and channels 
4096-5120. We use these distributions, and others like them for other spectral and gate 
ranges, to fit for the parameters of our theoretical models, as discussed in Section We 
used intervals of w = 0.0002 for the histograms in this paper. Wider bins would average 
over the smooth, rapid variations of the distribution among bins. 
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Projection has a number of advantages. One-dimensional distributions are much easier 
to visualize and fit than 2-dimensional distributions. Projection increases the number of 
samples per cell, reducing Poisson noise. For the short Mopra-Tidbinbilla baseline, amplitude 
variations from scintillation affect primarily the real part, broadening it along the real axis. 
Phase variation from scintillation affects the imaginary part, and noise affects both real and 
imaginary parts. Consequently, the second moment of the imaginary part in Q provides 
a useful constraint on noise and the effects of finite baseline length. For a short baseline, 
visibility is concentrated near the real axis, so most of the information is contained in these 
two distributions. 



3. ANALYSIS 



3.1. Overall Strategy 



We seek to compare the observed distribution of interferometric visibility, as expressed 
by the projected histograms, with theoretical m odels. These models include the expected 
distribution of visibility fo r a scintillating source (I Gwinnl 120011 ) and the distribution of noise 
(IGwinn et al.ll201lL |2012| ). Fitting a model involves calculating the model distribution for 
a given set of parameters, quantifying its difference from the observed distribution with a 
figure of merit, and then searching out the minimum such difference as a function of model 
parameters. We wish to ensure that our search is as broad as possible, so we begin with 
a grid search using the easily-calculated model for zero baseline length. We then use the 
best-fitting parameters from this search as initial conditions for a fit using a baseline of finite 
length. For both searches, we compare results for fits to a point-source model. We discuss 
the signature of finite size in the histograms. We present the results of fits for the size of the 
source, as a function of pulse gate and frequency. We discuss potential sources of error, and 
the statistical significance of the result. We compare these results with the inferred geometry 
of the pulsar's magnetic field. 



3.2. Distribution of Visibility 

3.2.1. Background 

In the absence of noise or amplitude variations, the interferometric visibility for a scin- 
tillating point source is the product of complex Gaussian random variables. The resulting 
distribution of visibility peaks sharply at the origin; indeed, the distribution is singular at 
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that point ( IGwinnll200ll ). Furthermore, the distribution has strong exponential wings that 



extend to large values. This large dynamic range complicates evaluation. 

For an extended source, also in the absence of noise or amplitude variations, the peak 
of the distribution softens and shifts toward greater real part, relative to that for a pointlike 
source. However, the variance decreases in both the real and imaginary directions. Thus, 
estimation of source size requires accurate calculation of the structure near the origin on 
small scales, and of the behavior of the wings at large values. Figure H] shows distributions 
for sources with zero size and with small size, on a short baseline, calculated using the results 
of this section; the parameters are comparable to those we find for the Vela pulsar in Section 
H] below. We discuss effects of noise in Section 13. 3^ and effects of amplitude variability in 
Section 13.41 



3.2.2. Visibility of a Pointlike, Scintillating Source 
Explicitly, the normalized distribution of visibility for a pointlik e, scintillating source 



without noise and with constant amplitude, is given by (lGwinnll200ll ): 



POO = -^Ao I ex P (2) 



7r K g (i - ^) "VO-pO <W "AO-pO ^ 

Here, p is the normalized covariance (that is, the normalized interferometric visibility of the 
source), and k is the scale. This distribution is that of a product, xy*, of correlated, complex 
Gaussian random variables. It describes the distribution of visibility, after averaging over 
an ensemble of electric field measurements; it does not include effects of background noise, 
self- noise, or intrinsic source variability. 



For a scintillating source viewed through an isotropic scattering screen, (iGwinnl 12001 
Equations 2, 4, 6) 

l(2n) 2 9 2 n b 2 
2 81n2~A 



P = ^\'-^ J &\- (3) 



Here, #h is the full width at half maximum of the scattering disk, b is the baseline length, 
and A is the observing wavelength. The mean correlated flux density of a pointlike source 
is, therefore, 

(V) = K p = K exp^(k9b) 2 Y (4) 

Here, the wavenumber is k = 2n/\, and the angular broadening is 9 = 9n/\/8 In 2. Angular 
brackets (...) denote an average over many scintillations. 
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For short baselines, p ~ 1, and the distribution of visibility is concentrated near the 
positive real axis; the imaginary part for a given value of Re[V] has variance that scales 
proportionately with Re[V]. For longer baselines, p — > 0, and the distribution is circularly 
symmetric about the origin. Figure 2 of iGwinnl (120011 ) shows examples. 

At zero baseline, p = 1, visibility V becomes intensity J, and P(V) becomes the well- 
known exponential distribution of intensity for a scintillating point source: 

1 



P(I) 



exp{-I/I }. 



(5) 



To connect the two distributions, note that a "zero-baseline interferometer" can, in principle, 
measure complex visibility, and will have complex noise; however, the visibility of the source 
is confined to the real axis, even with scintillation. Thus, Eq. is the projection of Eq. [2] 
onto the positive real axis, in the limit p — >■ 1. 



3.2.3. Visibility of an Extended, Scintillating Source 
The distribution of visibility for an extended scintillating source, wi thout noise a nd with 



constant amplitude, is the convolution of a number of copies of Eq. [2] (Gwinnl 1200 ll ). For a 
small, scintillating source, the distribution is the convolution of 3 such copies, two of them 
with scales k related to the size of the source along two orthogonal axes £, rj: 

k$ = K (kM9a^) 2 (6) 
K v = ^(kMOarj) 2 . 

Here, M = D/R is the effective magnification of the scattering disk, where D is the dis- 
tance from observer to scatterer, and R is the distance from scatterer to source. The di- 
mensions of the source are o~£, a v . We assume that the source is small in the sense that 
(kM8a^), (kM6a v ) << 1. If this condition does not hold, then additional terms are im- 
portant; the convolution involves more distributions. The covariances, corresponding to p, 
change for the subsidiary distributions as well: 

P5 = (l-(^) 2 )exp|-i(^|6|) 2 } (7) 
p v = {l-(b,kdf)e^{- l -{k6\b\? 



These are nearly equal to p for short baselines. 



We have not found a simple analytic expression for the result of the convolution for 
p < 1. Moreover, the convolution is challenging to reproduce numerically, because P(V) 
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has both a sharp peak at the origin, and high skirts that extend to large \V\. Our strategy 
is therefore to proceed as far as possible via analytic calculations, and then evaluate the 
remaining integrals numerically. 

To achieve our first numerical reduction, note that we can easily convolve visibilities 
that are drawn from identical distributions. The average of N such visibilities is distributed 
according to 

/ t /-\ 2 N N+1 1 {{VW"' 1 T , / 2N \V\\ ( 2Np RefV] 
Pn{V) = -j- T — —t- — K N -i - r — exp 



71 (N- 1)! (1 -p 2 )K 2 \ K J "~ L \1- P 2 K ) ^\l-p 2 K 

We can use this result to calculate the convolution of visibilities drawn from different distri- 



butions if we first use Feynman parameters (jSrednickil 120071 ) in Fourier space to symmetrize 



the corresponding conjugate product of functions. Because the visibility is complex, con- 
volution of iV visibilities requires a 2(iV — l)-dimensional integral. However, to symmetrize 
the convolution requires a single Feynman parameter for each visibility. The Feynman pa- 
rameters also have an overall ^-function constraint, so the convolution is reduced to an 
(N — l)-dimensional integral. 

For a small, circular source, the observed distribution is a convolution of three visibilities, 
two of which are drawn from identical distributions. These two distributions are parametrized 
by Equations [5] and [7| with = a n and b^kO « 1, b v k9 « 1. This symmetry allows 
elimination of an additional degree of freedom, so the original four- dimensional visibility 
integral is reduced to a one- dimensional integral. Explicitly, we assume that k , po, «i, and 
pi are arbitrary, but k 2 = K\ and p 2 = Pi- For convenience, we also introduce parameters 
di = Kj (1 — p^ /2 and Pi = 2 [tck 2 (1 — p 2 )] . Then, the distribution of visibility is given by 

P(V) = (P *Pi*P 2 )(V) (9) 

= 7r 2 PoP lalat\V\ 2 [ ds (1 - s)f 1 (s)K 2 (f 2 (s) \V\) e^ s ^ v \ 
Jo 

where we have defined, for convenience, the 3 functions 

/i(s) = [(a 2 s + a\(l - s)) (a 2 s + (1 - s) (a\ - s [a x p G - a Pi] 2 ))] 1 (10) 

his) = 
Ms) = 



\Ja 2 s + (1 - s) (of - s [aipo - a pi] 2 ) 

a^s + a\{l — s) 
a p s + aipi(l - s) 



a^s + a\ (1 — s) 

This one- dimensional integral is suitable for efficient numerical evaluation. 
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3.3. Noise 



Noise broadens the distribution of visibility described in the previous section. It softens 
the peak without shifting the centroid of the distribution. Noise arises as a background from 
unrelated sources and within the telescopes, and from the source itself ("self-noise"). In 
principle, background noise is independent of the behavior of the source, whereas self-noise 
increases with the flux density o f the source, and ha s different magnitude in and out of phase 
with the interferometric signal ( iGwinn et al.ll2012l ). 



The Dicke equation conveniently describes noise and self-noise. This equation states 
that error in measurements of antenna temp erature ST y aries with total system temperature 
T, including the contribution of the source (lDickdll946h : 



(STf 



N, 



obs 



where N ohs is the number of samples. This equation describes how accurately N ohs observed 
samples from a Gaussian distribution can measure the variance of the distribution (or, for 
interferometry, the covariance of two distributions). The net noise in interferometric visibility 
has variance that increases quadratically with the signal in phase with the signal; and linearly. 



with the sa me constant and linear coefficients, at quadrature with the signal ( IGwinn et al. 
201 li l2012h . 



The effect of noise on the distribution is not a convolution, because the noise depends 
on the visibility. Hence, it resembles a convolution, but with non-stationary kernel. Because 
we average over approximately 33 elements in frequency and time for each sample, after 
taking bandwidth, integration time, spectral resolution, and pulse gating into account, we 
assume that the noise follows a Gaussian distribution. For interferometric observations, a 
quadratic polynomial specifies the variance of noise in phase with the visibility, a 2 ; and the 
linear terms specify a\, the variance at quadrature: 



jj = ((SRe[V]) 2 ) 
I = ((Slm[V]) 2 ) 



bo + hdVD + hdV]) 2 



12) 



The parameters {60,61,62} describe the noise. For a source of constant flux density, 62 is 
simply the reciprocal of the number of samples, iV b s . For an source of varying amplitude, 
6 2 is l/iV bs plus (51 /I) 2 . Here SI is the standard deviation of the flux density on timescales 
longer than the accu mulation time 819 2/16 MHz = 512 fisec, but shorter than our 2-sec 
integration time (see 



Gwinn et al. 2011. Section 2.2.2 



Quantization of the signal, when it is digitized for recording, also affects noise. Quan- 
tization introduces additional background noise, and a gain that scales signal and noise 
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( lGwinnll2006l ). These parameters change with quantizer levels, in units of the v ariance of the 



electr ic field. Consequently, for this experiment they change with pulse gate (IGwinn et al. 



20121 ) . 



In this work, we fitted for the coefficients {&o,&i,&2} in Equation [T2l for each pulse 
gate and spectral range. We fit 60 and b\ freely, and fit for 62 with the requirement that 
b 2 > 1/iVobs = 1/33. The results of these fits are co mparable with the noise parameters 
obtained from differences of samples adjacent in time in lGwinn et all (120121 ). when the signal- 
to-noise ratio provides diff e rences with enough accuracy to measure the noise. The binning 
technique of IGwinn et all (120121 ) provides for easy visualization of the noise distribution 
and is independent of the underlying distribution of visibility, but is subject to biases and 
requires high signal-to-noise ratio. We adopt the global-fitting technique described above for 
this work, because it is more precise and is applicable for arbitrary signal-to-noise ratio. 



3.4. Amplitude Variations 



The pulsar changes amplitude with both time and frequency during the observations. 
For our observations, variations in shape and amplitude of individual pulses dominate time 
variability. Individual pulses vary in flux density, as well as in shap e and arrival time 
(jKrishnamohan fc Downslll983l ; iJohnston et al.ll200ll ; Kramer et al.ll2002l ). For the Vela pul- 
sar, these changes are almost uncorrelated between successive pulses. Variability changes 
with pulse phase: it is largest at the beginning and end of the pulse, and less during the 
pulse. 

Because the pulse is dispersed, higher-frequency channels sample a later pulse phase 
than lower-frequency. Thus, the average spectrum reflects the pulse profile, as shown in 
Figure [H Dispersion produces significant amplitude variations with frequency, even over a 
range of 1024 channels. These spectral variations reflect the average shape of the pulse, so 
they are stationary with time. Comparison of amplitudes averaged over a 1024-channel range 
of data, over longer timescales, show no further, slow effect of gain variations or variations 
of the source over the span of our data, as described in Section [2j 

Intrinsic amplitude variatio ns on timescales sho rter than the accumulation time do not 
affect the distribution of noise (IGwinn et al.ll201ll ). On timescales between the accumu- 
l ation time and th e integration time, amplitude variations contribute to noise through 6 2 
(IGwinn et all 120111 ). On timescales longer than the 2-sec integration time, intrinsic ampli- 
tude variation s lead to superposit ion of distributions, with different values for the amplitude 
parameter kq (IGwinn et al.ll2000l ). Our 2-sec integration averages over 22 or 23 pulses, re- 
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ducing the expected modulation by a factor of approximately 4.7. We parameterize the 
remaining amplitude variation by the intrinsic modulation index m s = a/ (I 2 )s2/ (-OS ~ 1> 
where the subscripted angular brackets (...} S 2 indicate an average over those intrinsic varia- 
tions with timescales longer than the integration time of 2 sec. 



3.5. Evaluation 

3.5.1. Projection onto the Real Axis 

We project the model distributions, including effects of noise and amplitude variations, 
onto the real axis. We calculate the projected probability density, and the summed squared 
imaginary part: 

pX k +w/2 poo 

V(X k ) = / dRe[V] / dlm[V] P(V) (13) 

JX k ~w/2 J~ 00 

pX k +w/2 poo 

Q(X k ) = / dRe[V] / dlm[V] lm[V] 2 P{V). 

JX k -w/2 J -00 

These are scaled models for the projections Vn, Qn of the data as described in Section |2.3.4[ 
and plotted in Figure [3j Note that Q(X k ), the mean square imaginary part in each bin, is a 
second moment of Im[V], and so is expected to be noisier than V(Xk), the zeroth moment 
of Im[V]. The higher noise for Q in Figure [3] reflects this. 



3.5.2. Noise and Amplitude Variations 

Effects of noise and amplitude variations must be calculated numerically for the full two- 
dimensional distribution, and then projected. Noise changes the value of each measurement, 
and thus spreads the corresponding probability distribution function over a surrounding 
region. It smoothes a spike, corresponding to a single deterministic value, into an elliptical 
Gaussian distribution centered at that point, with variances given by the noise polynomial, 
Equation [121 evaluated at that point. The effect of noise on the distribution of visibility 
thus resembles a convolution of the probability distribution of the scintillating source, with 
the Gaussian distribution of noise. It is not a convolution, because the noise depends on 
the signal: this operation is sometimes called a "convolution with varying kernel". The 
Gaussian noise, with dependence on signal strength, is the kernel in this case. Because 
the kernel varies, we must project after convolution. Because the distribution is relatively 
concentrated along the real axis, the notion of convolution after projection has intuitive 
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value, but we do not use this construction in our analysis for p < 1; we calculate the full 
variation of the noise kernel. 

The effect of amplitude variations on timescales longer than the integration time is to 
superpose distributions with different k but the same noise polynomial. We implement 
this effect numerically, by averaging a number of distributions with varying kq- Fortunately, 
integration over 2 sec reduces the intrinsic amplitude variations, as discussed below. 



3.5.3. Sample Evaluation 

A sample calculation illustrates our method, and shows the origin of the W-shaped 
signature of finite source size in the difference of projected distributions V. Figure [5] shows 
how the best-fitting zero-size and finite-size models differ, for the span of data shown in 
Figure HI The upper panel of the figure shows the projected distribution V without noise 
or amplitude variations. For zero size, the projected distribution shows a sharp cusp at 
Re[V] = 0, with an exponential decline along the positive real axis of V, and a more sharply- 
declining exponential along the negative real axis. The negative-side exponential declines so 
sharply because p = 0.986 ~ 1 in this example. For a model with finite size, with the same 
normalization and mean, the bulk of the projected distribution is shifted toward the positive 
real axis, with a rounded peak. For the same mean amplitude, the peak has a narrower 
spread. 

Note that the point-source model has greater probability at large and small Re[V] than 
does the extended-source model, but the extended model has greater probability in between. 
This behavior arises because we require that the two distributions have the same mean and 
normalization. The shift of th e maximum t oward greater Re[V] for the finite-size model then 



requires a reduction of k (see lGwinnll200ll . Equation 31). The largest exponential scale, k , 
dominates the behavior of the distribution away from the origin, so that a finite-size source 
concentrates the distribution of visibility. 

Noise broadens the distributions, as the middle panel in Figure [5] illustrates. The best- 
fitting noise parameters are {&o,fri,&2} = {0.000062,0.0037,0.078} for the zero-size model, 
and {0.000058, 0.0045, 0.181} for finite-size. The noise at V = has standard deviation v&o, 
or approximately 0.008. This value is comparable to the width of the distribution, so that 
the details apparent in the top panel of the figure are blurred. Moreover, the noise increases 
away from V = 0, as given by the higher-order coefficients. 

Intrinsic amplitude modulation also changes the distributions. The mean square of 
the intrinsic amplitude modulation is ml = 0.009 for the best-fitting zero-size model, and 
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rrig = 0.007 for the best-fitting finite-size model, for our example of spectral and pulse-gate 
range. In both cases the degree of modulation is small as expected after integrating 22 or 
23 pulses. 

The lower panel in Figure [5] shows the difference of the zero-size and finite-size models. 
Even after including effects of noise and variability, the underlying features of the two dis- 
tributions persist: the finite-size distribution has smaller probability density for Re[V] < 0, 
because of its lesser density near Re[V] = in the top panel; and at large Re[V], because 
of its more rapid decline there. It has greater density near the mean amplitude. Thus, we 
expect the signature of a finite-size source to be a W-shaped difference of V for the best 
finite-size model from the best zero-size model, after effects of noise and intrinsic amplitude 
variations have been included. 

The situation for the distribution of mean square imaginary part Q is somewhat similar 
to that for V, but the fractional difference between models is less. Indeed, nearly all of the 
spread in imaginary part results from noise, for values of p of interest for this baseline; so 
that the distribution Q functions more as a constraint on the noise model, than as a carrier 
of information about source size. We present plots for Q below, in Section 14.1.41 



3.6. Calculation of model 

3.6.1. Nested Iterative Integration 

Our calculation of the model distribution involves 4 nested iterative loops. We calculate 

V and Q in parallel, on a grid of points in the real part of visibility, X^. Each histogram bin 
has width w = 2 x 10~ 4 . This size is narrow enough to track the behavior of the distribution 
accurately, but wide enough to contain enough visibilities so that the Poisson noise does not 
obscure the model differences. 

At the lowest level, we integrate over s to find the probability density P(V) at a point 

V in the compl ex plane, using E quation [9j We integrate this expression iteratively using 



Simpson's rule (IPress et al.ll2007l ) with 2 x3 N segments of equal width on the N th iteration. 
The integration terminates when successive iterations have a fractional difference of less than 
0.1%, or iV > 10. Because the error bound for this integration scheme goes as N~ 4 , this 
criterion yields an expected accuracy of ~ 0.001% - a small fraction of the magnitude of 
measurable source size effects. 



We broaden the probability density at each grid point by its corresponding ellipti- 
cal Gaussian distribution of noise. The polynomial coefficients {6q ? ^2} (Equations [T2l 
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parametrize the noise, and the complex visibility at the grid point determines the scale and 
orientation of the ellipse. We use the analytic expressions for the projection of a Gaussian 
distribution and the projected second moment of a Gaussian distribution to evaluate the 
contribution of each point, with noise, to the projections V and Q on our grid of points. 

At the second level, we integrate the contributions to V and Q over the imaginary 
part of V. We integrate from the real axis out to the 4-sigma standard deviation for the 
distribution in the absence of size effects, as calculated from the second moment of Equation 
[2J Again, we use Simpson's rule iteratively with 2 x 3^ segments of equal width, but now 
require a fractional change of less than 10~ 8 in V{Xk) between two successive iterations. We 
monitor V(Xk) because it is more sensitive than Q(Xk) to behavior near the real axis, where 
the distribution varies most rapidly, so that V(Xk) converges more slowly. 

At the third level, we integrate the values of V and Q over sub-bins within each his- 
togram bin along the real axis. This step accounts for effects of finite histogram bin width 
in discrete representations of a probability distribution function. It is most important when 
there is large- amplitude non-linear structure, as exhibited by the rapid rise of P at the origin 
and the cusp of the noise-free distribution near the origin. In the limit of small bin width, w, 
the histogram representation of a function f(x) will systematically overestimate the function 
at xq by w 2 /"(xo)/24. We set the number of integrated sub-bins to 3; comparison with com- 
putations using 5 and 7 sub-bins for the fits that obtained smaller sizes yielded insignificant 
differences. 

At the fourth and highest level, we account for the intrinsic amplitude variability of our 
2-sec integrations. We calculate each visibility distribution 5 times, with different source 
amplitudes, as given by a Gaussian distribution centered at 1, scaled by an amplitude- 
variation parameter. We then superpose these 5 distributions to produce a distribution 
including effects of varia tions of ampl i tude of the source on the final distribution. The 



procedure is that used by iGwinn et al.l (120001 ) to describe the distribution of scintillation in 



the presence of amplitude variations. We found no difference when using more finely-divided 
distributions of source amplitude. Effects of amplitude variations would be significant if 
the amplitude approached zero; but because Vela does not null and variations of individual 
pulses are only weakly correlated, amplitude variations after 2-sec integrations are small. 



3.6.2. Exponential model 



For zero baseline, p — 1, the dist ribution of visibility is a sum of exponentials on the 
positive real axis and zero elsewhere (IGwinn et al.lll998l ). This form obviates the lowest 
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2 levels of the 4-fold nested loops above, so the model is faster to calculate and, thus, is 
useful for exploration of parameter space. For this exponential model, we assumed a circular 
source, as we do for our p < 1 calculation. Tests showed that the two calculations were 
equivalent for p — > 1: a useful check of our numerical integration. 

3.7. Fitting 

3.7.1. Fit Parameters 



Seven parameters describe the model. Three of these are the coefficients of the noise 
polynomial, Equation [T2l {&o> &i> &2}- A fourth and fifth describe the distribution in the 
absence of noise and amplitude variations: the amplitude scale Kq and the normalized mean 
interferometric visibility p. The scale Kq is proportional to flux density. The mean inter- 
ferometric visibility p describes the baseline length and angular size of the scattering disk 
(Equation [3]). We do not provide a parameter for overall normalization: we constrain the 
normalization of the model distribution to equal that of the observed distribution. A sixth 



parameter describes intrinsic amplitude variations 



(SI 2 / (I) ) on timescales longer 



than an integration time; because of such variations, the observed distribution involves a 
superposition of different values of Kq. The parameter (m 2 ) gives the variance of this distri- 
bution of amplitude fluctuations, normalized by the mean amplitude. A seventh parameter 
gives the size of the source, usually expressed as K rat io = K i/ K o = {kM6a) 2 . In principle, 
additional parameters give correlations /i, v for the subsidiary dis tributions due to source 
structure and the intrinsic elongation of the source (jGwinnl 1200 ll . Equation 28); however, 
these are expected to be nearly equal to p for a short baseline. 

Among the effects we did not include in our model are elongation of the scattering disk 
and elongation of the source. For a constant orientation of the baseline, elongation of the 
scattering disk ha s no effect on the distribution of visibility, for the appropriate parameters 
p, Ko, and K rat io (jGwinnl l200ll ). The rotation of our baseline was small during the test 
interval considered here. Effects of elongation of the source are more subtle, but appear 
in our simple model only when the source is nearly resolved by the scattering disk, or the 
baseline is relatively long. 

Note that p is a property of scatter ing, and shou ld be constant over the entire pulse. 
Finite source size will change it slightly (jGwinnl l200ll Equation 18). A source with spatial 
coherence, over scales comparable to those d etectable via scintil lation, can also change p by 
illuminating only part of the scattering disk ( jGwinn et al.lll998l ). 
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3.7.2. Summed, Weighted Squared Residuals 

Our fits minimize the weighted mean square difference between the data and the model. 
Individual bins in the histogram V measure a number of counts N, and are expected to 
display Poisson statistics. For Q the values are squares of drawn from a nearly Gaussian 
distribution, and should display statistics analogous to that of the Dicke equation, Equation 
ITT1 In both cases, the distribution of noise is nearly Gaussian, with variance proportional to 
the bin value. We adopt this weighting above a threshold of N = 100 counts, with constant 
weighting below that value so as to reduce the influence of bins with zero or few counts. 

The two histograms V and Q have different dimensions and different vertical scales. 
We seek to combine the residuals for both into one figure of merit. A reasonable conversion 
factor is simply the quotient of the integrated areas under the distributions, which has the 
correct dimensions. However, we expect the second moment to be noisier, as Figure [3] shows. 
To quantify this noise in Q, we fit the distributions with simple models to find smoothed 
average values, and then difference adjacent points to determine the noise. We find that the 
standard deviation of noise in Q determined in this way is typically 3 times that in V. We 
therefore weight the residuals in Q by 1/9 of the ratio of the areas under the two histograms. 
This has the effect of making the mean square residual roughly equal for the two, for our 
models. 

We find that the mean square residual is close to 1 for V with this weighting, for the fits 
described in Section H] below. This indicates that the statistics are indeed Poisson: number 
of counts limits the accuracy of the histogram values in our narrow bins. As expected for 
our scaled weighting, the mean square residual is close to 1 for Q, as well. 

As a further test, we also fit to V and Q independently. We find that fits to only V found 
minima close to those found using both V and Q, with equal or sometimes larger intrinsic size 
of the source. Fits to only Q typically do not converge; apparently this distribution does 
not contain enough information to determine model parameters independently. Fits with 
uniform rather than Poisson weighting yield similar results for size, but tend to converge less 
quickly. 

3.7.3. Grid Search Using Exponential Model 

In order to examine parameter space over large scales, and to provide initial parameters 
for our fits for arbitrary p, we performed a grid search using an exponential model, as 
described in Section 13.6.21 Because such a model can be calculated quickly, a grid search 
can survey large regions of parameter space efficiently. In our grid searches, we fit for 
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three parameters: two noise coefficients 60, &i and the amplitude k . We searched a grid 
of parameters in the noise coefficient b 2 and the size parameter ( kMOa) 2 . We dem anded 



that b 2 > 0.030, as dictated by the number of samples correlated (IGwinn et al.ll2012l ). and 
searched 0.030 < b 2 < 0.4. We ignored effects of longer-term intrinsic amplitude fluctuations 
between integrations: m s = 0. 

The grid search indicated that the summed, weighted squared residuals vary smoothly 
over the parameter space defined by the five varying parameters. We found only one min- 
imum in all IF, spectral and gate ranges. The best-fitting size of the source a defined by 
this minimum was usually significantly different from 0, with a reduction in the weighted 
residuals from the best-fitting zero-size model comparable to that found from the more so- 
phisticated fits discussed below. The size parameter {kM6a) 2 usually agreed to within 30% 
for the two kinds of fits. The amplitude-modulation parameter m s is responsible for much 
of the difference in size. The minimum found by the grid search provided a useful starting 
point for the much-slower fits with p < 1 and m s > 0, discussed in Section H] below. 



3.7.4. Levenberg-Marquardt Algorithm 



We use calculations of V and Q as described in Section 13.51 to fi t for model param eters 
with finite size and p < 1, using the Levenberg-Marquardt algorithm (IPress et al.ll2007| ). We 
fit for the 6 parameters described in Section 13.7.11 using weighting described in 13.7.21 We 
initialize these parameters using the results of the grid search with an exponential model. 



3. 7. 5. Normalized Visibility Parameter p 



We expect p = 0.986, based on the parameters reported by lGwinn et al.l (119971 ) : angular 
broadening of (3.3 x 2.2) mas (full width at half-maximum intensity), with the major axis at 
position angle 92°, in observations at wavelength A = 13 cm. We scale these to our observing 
frequency by 9 oc u 2 , and use the len gth and o rienta tion of our baseline as described in Section 
12.3. II to find p, using Equation 28 of iGwinnl ( 120011 ) . In tests, we found that the mean square 
residual for this data set was not particularly sensitive to p, for 0.85 < p < 1, so that our 
observations do not provide a good way to fit for p: the baseline is too short to provide much 
information. Our results for the fitted size, in particular, were not sensitive to p within this 
range: using p = 0.92 rather than p = 0.986 changed the inferred size parameter (kM6a) 
by less than 10%. In several frequency ranges where the pulse was strong (channels 1024 
through 4096 of Gate 2 of both IFs), fits including p strongly favored p > 0.9, with p = 0.92 
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sometimes favored within that range. We expect p to be almost the same for all of the data, 
in all channels and gates. Differences from changes in frequency were too small to detect. 
Effects of source size are expected to be small for our short baseline. Indeed, we expect 
from the m oments of the distribution that p and size parameter (kM8a) 2 should have little 
covariance ( 1 Johnson fc Gwinn II2013I ). We adopt p = 0.986, noting that revisions may cause 
small changes in the inferred size. 



4. RESULTS AND DISCUSSION 

4.1. Finite- and Zero-Size Fits 

We performed independent Levenberg-Marquardt fits for both finite and zero size sources. 
Comparison of the two provides a measure of the significance of our results. Both sets of fits 
were initialized with the best-fitting parameters for finite or zero size found from our grid 
search. Both allowed all the other parameters to vary: noise coefficients {&o, 61,^2}; ampli- 
tude scale Kq, and intrinsic amplitude variations on timescales longer than our integration 
time, m s . 



4-1.1. Sample Fit 

The lower panels of Figure [3] show the residuals to a fit with zero source size, in histogram 
form. Structure remains in the residual histogram, apparent as variations near zero visibility. 
An independent fit, to the same data but with a finite source size, removes most of these 
variations. To illustrate this, we display the difference of the finite-size model from the zero- 
size model, as smooth curves. The difference curve for V is the same as that shown in the 
lower panel of Figure [5j Like the residuals, this curve shows the W-shaped signature of finite 
source size. 

The mean squared residuals in V, after fitting for finite source size, are approximately 
those expected for Poisson-noise dominated errors. The data contain 437 bins with more 
than 100 counts. The sum of the squared Poisson-weighted residuals, after fitting, is 434. 
Thus, the mean square errors are approximately as expected. The fit to a model for a source 
with zero size leads to a sum of the squared Poisson-weighted residuals of 621, significantly 
greater. 

For <2, the sum of the squared residuals, after correction by the relative areas of V and 
Q, and the factor of 3 from comparison of differences, is 705 for a finite-size model, and 785 
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for a zero-size model. The residuals errors in Q appear noiselike in the plot, suggesting that 
the factor of 3 estimated from differences in other spectral and gate samples may be low for 
this sample of data. Nevertheless, the finite-size model improves the fit for Q. 

The reduction in summed mean-square weighted residuals for V and Q together is 
19%. This difference is highly statistically significant at more than the 40-cr level, for our 



samp le size and number of parameters, according to the F-ratio test (IBevington fc Robinson 



20031 ). At this high level, finite sampling limitations are unlikely to dominate errors in our 
estimates of emission size. However, this figure demonstrates that the signature of finite size 
appears in the data, as inspection of the figures suggests. The best-fitting size parameter 
is (kM9cr) 2 = 0.0423, or scaled size (kM9a) = 0.21. This corresponds to a source size of 
approximately 180 km (standard deviation of the Gaussian distribution), or to approximately 
420 km for the full width at half-maximum, as we discuss in Section 14.1.31 below. 

Standard errors are small, because of the high significance of the fits as expressed by the 
F-ratio test. We present the best-fitting parameters for this IF, gate and channel range in 
Table [TJ These values are typical for our fits. As we discuss in Sections 14.1.21 and 14.21 below, 
errors in the estimated size are probably dominated by systematic errors. 



4-1.2. Fits to All Spectral and Gate Ranges 

We fit our model to spectral ranges of 1024 channels in each of the 5 gates, for both 
IFs. Our fits indicate that the pulsar has a rather large size at the start of the pulse, de- 
creases in size over the first half of the pulse, and then increases in size again. Figure O 
shows the results of fits, giving the scaled size (kM0a) and mean amplitude (k + 2k\) as 
a function of pulse phase. Note that linear diameter is proportional to the square root of 
the fitted size parameter, (kM6a) 2 . All of the fits are independent; each point represents 
an independent sample of the pulsar's emission, fit completely independently using a priori 
parameters from independent grid searches (Section l3.7.3p . The two IFs are shown by differ- 
ent symbols (crosses for IF1 and circles for IF2), which represent distinct frequency ranges 
and thus completely different scintillation patterns. At some phases, points from different 
gates overlap; at these points, a higher frequency range in one gate coincides with a lower 
frequency range in the previous gate. Quite often the noise parameters for these different 
samples are quite different; nevertheless, size and amplitude track one another with pulse 
phase, despite the disparate origin of the samples. Some of the fits do sample identical 
samples of the diffraction pattern from interstellar scattering, as Figured suggests: however, 
these often lead to completely different results, indicating that the results arise from the 
pulsar, rather than from scattering. For example, data that includes the 3 panels shown 
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in Figure 1 yields scaled sizes of (kMQa) = 0.08, 0.09, and 0.40. (Of course, the grayscale 
in Figure |2] emphasizes peaks of visibility, whereas the most critical information on source 
size is found near zero visibility, as Figure shows.) The dependence of fitted size on pulse 
phase, rather than on the frequency or details of the scintillation pattern, suggests that the 
effect arises from the pulsar rather than the scintillation pattern. 

Our sample comprises 60 independent fits, over the two IFs, 5 gates, and 6 spectral 
ranges. A fit with finite size yielded significantly smaller residual for most of these: for the 
43 fits with fitted size parameters of {kM9a) 2 > 0.017, the reduction of the residuals was 
greater than 3%. This is significant at better than the 5-a level according to the F-ratio 
test. Unsurprisingly, gate and spectral ranges with small fitted sizes, and regions with small 
amplitudes at the beginning and end of the pulse, show the least significant results. We do 
not display statistical errors from the fits on this figure; they are smaller than the plotting 
symbols. 

The scatter of independent measurements at a given pulse phase provides a measure 
of the precision of our results. The residuals are dominated by variations among nearby 
bins, consistent with Poisson noise; whereas the models predict rather more slowly-varying 
differences across the histograms for V and Q, as Figure |5] would suggest. We made the bins 
narrow so as to follow the rapid variation of the histograms with Re[V], as shown in Figure 
|3j we cannot broaden the bins without destroying this important information by averaging. 
We discuss averaging of the post-fit residuals over bins to visualize effects of the model in 
Section 14.1.41 below. 

4-1.3. Conversion to km 

The angular width of the scattering disk 9 is important for the conversion of the fitted 
size parameter (kM9o~) 2 to o in km. The angular width appears directly in this expression, 
and also affects the inferred magnification M as we discuss below, so that the linear size of 
the emission region a depends strongly upon 6. Unfortunately, our data do not provide a 
good determination of p, as discussed in Section 13.7.51 above, and so do not allow accurate 
determination of 9. Consequently, we adopt a simple conversion based on previous work, 
while noting that more accurate measurements of 9 and M may revise the inferred conversion 
of (kM9a) to a, in kilometers. 



Gwinn et al.l (119971 ) found angular broadening of (3.3 x 2.2) mas (full width at half- 
maximum intensity), with the major axis at position angle 92°, in observations at wavelength 
A = 13 cm. We adopt this value here. For simplicity in converting the observed (kM9a) to 
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size ex, we assume a circular scattering disk with full width at half maximum of 9 = 5.2 mas; 
this is the mean square of the major and minor axes. For the scintillation bandwidth, 
we adopt the value of Av s = 15 kHz from Section 14.2.41 below. Wi th a distance to th e 
pulsar of D = 287 pc, as measured from the parallax of the pulsar ( IDodson et al.l 120031 ) . 
the combination of angular broadeni ng and scintillation bandwidth yields a magnification of 
M = D/R = 3.1 (IGwinn et al.lll993l ). Consequently, the standard deviation of the Gaussian 
model for the source is cr e = {kMOa) x 859 km. In this expression, the subscript 9 serves to 
indicate the dependence of the inferred source diameter on the angular size of the scattering 
disk, and the magnification. The full width at half-maximum diameter of the fitted model is 
V8 hx2aQ. The right-hand vertical axis on the lower panel in Figure [6]reflects these conversion 
factors. The size is as large as hundreds of km, expressed as standard deviation of a circular 
Gaussian distribution. 

Changes in the assumed parameters for angular broadening will modify the inferred 
emission size. The angular scale of the scattering disk 9 sets the scale of the scintillation 
pattern at the source, and thus the resolution of the lens; and 9 sets the location of the 
scattering material along the line of sight, and so the magnification factor M. If the value 
assumed for 9 changes, the inferred size a changes, even though the fitted parameter (kM9cx) 
remains unchanged. The net resulting effect is approximately proportional. Thus, halving 
the assumed angular broadening reduces the inferred size expresse d in ki lometers by a factor 
of 2, and so on. The previous measurement of 9 by IGwinn et al.l ( 119971 ) covered an incom- 
plete arc in the (u, v) plane, and depended on simpler approximations to the distribution 
of visibility for a scintillating source than the models used here; the measurement should 
be repeated. Refractive scintillation can change the angular broadening with time, but the 



varia t ion is expected to be only about 85 microarcseconds for this line of sight (IRomani et al. 



19861 ; lNarayanlll992l ). If scattering material is distributed along the line of sight rather than 
in a thin screen, and some of it is located close to the source, it can increase the magnifica- 
tion M without a change in the measured angular broadening. This would require scattering 
close to the pulsar, within the Vela supernova remnant, where density is expected to be low. 



4-1.4- Binned Residuals 

We visualize the significance of our fits, by binning our residuals. This reduces effects 
of small-scale noise while allowing us to present the signatures of source size in the data. 
Philosophically, re-binning the residuals is similar to subtracting a model from the data. For 
example, we subtract a model for source position, baseline length, and Earth orientation 
from the phases in the correlator, so that the "sky" fringe rate can be reduced enough 
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for time integration to recover the fringe with high signal-to-noise ratio. Fringing reveals 
the inaccuracies of the subtracted model. Here, analogously, we remove a model for the 
distribution of visibility from the histograms of data, reducing variations between neighboring 
bins from noise. We then average the residuals of nearby bins so that the effects of source 
structure are more easily apparent. This averaging reveals the degree to which our model 
fits match the observations. 

Figure [7] shows the result of binning residuals, again for the data in IF1, Gate 1, channels 
4096 to 5120. After fitting zero-size and finite-size models, the residuals were binned and 
averaged in groups of 20, for resulting widths of 20 w = 0.0040 in Re[V]. Averaging (rather 
than summing) keeps the normalization the same. The dotted histogram shows the residuals 
from the fit with zero size for the source. We also display the differences between the model 
for finite source size and for zero size as solid lines. The curves are the same as those 
shown in Figures and [3] and [5j A zero- size source would show a flat, dotted histogram. 
A model that perfectly corrected the deficiencies of the zero-size model would track the 
dotted histogram perfectly, allowing for the finite widths of the bins. Clearly, adding a size 
parameter explains most of the slowly-varying residuals. Quantitatively, after binning the 
mean square residual is reduced by a factor of 13, by including a size parameter. This is 
much larger than the reduction observed before binning. As Figure [3] suggests, Poisson noise 
dominates the residuals without binning, and is greatly reduced by binning. 

The model fit is imperfect. Some systematic variations remain, more so in some ranges 
of pulse phase than others. Figure [S] shows residuals for the zero-size model, and differences 
between finite-size and zero-size models, for representative spectral ranges in three gates. Fits 
to the same spectral range and same gate, but different IF, resemble one another strongly, as 
expected because they lead to nearly the same fitted size. Noise and amplitude parameters 
vary between IFs, and even more between gates and channel ranges, so that the binned 
residuals are not identical. 

The upper pair of panels of Figure [8] show IF2, gate 1, channels 4096 to 5120: these data 
are equivalent to those in Figure [7J in pulse phase and spectral range, but are for the other 
IF. The data are thus completely independent. Introduction of a size parameter reduces 
the mean square residual of V by 92%, and Q by 12%. The best-fitting size parameter is 
ki/kq — {kM8a) 2 = 0.043, or scaled size (kM8a) = 0.21. The IFs agree closely in residuals 
and in fitted size. This range lies on the rising part of the pulse profile, before the peak. 

The middle pair of panels show IF1, gate 2, channels 2048 to 3072. This range lies near 
the peak of the pulse. The best-fitting size parameter is k\/kq = (kM6a) 2 = 0.006, among 
the smallest results we obtain. Before binning, introduction of the size parameter reduced 
the Poisson-weighted residuals by less than 1%; this reduction is significant at the 2-a level 
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according to the F-test. After binning, the reduction in mean square residual for V is 32%; 
however, zero-size and finite-size models both fit Q almost equally well. Some systematic 
effects appear, particularly the sharp downward spike in V near Re[V] = 0. 

The lower pair of panels show IF2, gate 3, channels 4096 to 5120. This range lies on the 
trailing side of the pulse, near where it briefly flattens into a plateau. At shorter observing 
wavelength, this plateau becomes a second component that arises close to this pulse phase. 
Introduction of a size parameter decreases the residuals, by 10% in V and by 4% in Q 
before binning, and by 54% and 24% after binning. Again the influence of Q on the overall 
fit is small, because zero-size and finite-size models both fit Q well. The best-fitting size 
parameter is [kM6a) 2 = 0.039, or (kM9a) = 0.20. Although the fit to V is good, some 
systematic variation appears as a shifting of the histogram relative to the best-fitting model; 
our circular-Gaussian model does not accommodate such a shift. We are investigating a 
variety of simple models for the systematic variations, as we discuss briefly in Section 14.31 
below. 



4.2. Systematic Effects 

The effects of size that we observe in our data include its W-shaped signature in the 
distribution of visibility V, its variation with pulsar phase, and its constancy for different 
spectral ranges and samples of the scintillation pattern at the same pulse phase. The W- 
shaped signature of finite size on the distribution of visibility is quite characteristic, as 
Figures [3j [5j and [7] suggest, and appears for both IFs and many pulse gates and spectral 
ranges. The agreement of fitted size is good between IFs, for overlapping spectral ranges 
at different frequency but the same pulse phase, and between nearby but different spectral 
ranges. Effects that can match all of these, or even only the latter two, are nearly all 
associated with geometrical effects at the pulsar. 

4-2.1. Errors in Model Parameters 

Further significant changes in the magnification factor from changes in pulsar distance 
seem unlikely. The distance from parallax is far more reliable than that from dispersion 
measure. In principle a model that included significant scattering in the immediate neigh- 
borhood of the pulsar, as well as scattering in the surrounding Vela supernova remnant, 
could increase the effective magnification by bringing the lens closer to the pulsar. Tem- 
poral broadening at low frequencies suggests that scattering material is concentrated into 
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a thin screen ( IJohnson fc Gwinn 1 120131 ) . Previous measurements of the angular broadening 
9 should be revisited, as described in Section 14.1.31 above. Further studies of the angular 
broadening, using long baselines with different orientations, can help to improve on the 
earlier results. This may change the inferred linear size a. 



4-2.2. Instrumental Effects 



Instrumental effects are unlikely to reproduce the signature of source size on the dis- 
tribution of visibility, as shown in Figure because they tend to vary with pulse gate and 
frequency within the passband. For example, for both IFs, the noise parameter 6 shows a 
similar pattern for each pulse gate, reflecting variations in the gain and noise in the pass- 
band, with an ove rall offset for each gate that reflects changes in quantization noise (see 
Gwinn et alj|2012j , Section 5.1). Effects of quantization, in particular, can be expressed in 
the spectral domain as a gain and a change in noise, and so are absorbed into the amplitude 
parameter and &o- Effects of variation of noise parameters would be expected to appear as 
differences between overlapping gates. 

The direction of linear polarization varies smoothly across the pulse. We observe only 
left-circular polarization, the fraction of whi ch varies little. Scintillat i on in the inter stellar 
medium is nearly polarization- independent (IMinter fc Spanglerlll996l ; ISpanglerll200ll ). De- 
fects in separation of polarizations at the antennas could produce artifacts that vary with 
pulse phase, but these can be represented accurately as pulse-phase-dependent gains. They 
would not produce the distortion of the distribution of visibilities that we observe as the 
signature of source size. 

Saturation effects can be important at large antennas such as Tidbinbilla. For our 
ob servations, these might affect particularly strong pulses, such as the giant pulses reported 
by iJohnston et al.l (120011 ). However, such pulses carry an insignificant fraction of the total 
flux density of the pulsar. Saturation effects can also reduce visi bility for the yery s trongest 
scintillations, at least for baselines involving two large antennas (jGwinn et al.ll2000l . Section 
4.5). These effects are expected to distort the distribution of visibility at the largest values, 
but are not likely to reproduce the W-shaped signature of finite size we observe. 



4-2.3. Calibration and Fringing Effects 



We perform relatively little calibration on the data. We do not change amplitude, 
particularly within a scan or with frequency. We do change phase, in fringing. Mis-fringing 
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is possible for observations of a scintillating pulsar in the speckle li mit, because the p hase and 



amplitude vary on the frequency and time scales of scintillation ( iDesai et al.lll992[ ). so that 
the model used for fringing can be challenging to fit. Mis-fringing would tend to increase 
phase variations, while keeping amplitude the same. Such effects are reduced for a short 
baseline, such as the Mopra-Tidbinbilla baseline, because the phase variation is less. If the 
phase errors were very large, it could potentially alter the distribution of visibilities in our 
projections, by altering the relative sizes of real and imaginary part. We reduced errors from 
mis-fringing by fitting our fringe model over approximately 2 decorrelation times and 800 
scintillation bandwidths. The wide bandwidth reduces effects of scintillation-based phase 
variations. The short time was chosen to match any possible ionosphere- or atmosphere- 
induced phase variations. 

Effects of errors in the fringe model on size would be expected to reproduce among 
gates, since the fringe model from Gate 2 was applied to all gates (with the exception of 
a single phase across the band, fitted for each gate); however, size varies with pulse phase, 
rather than by channel range within gates, and is near zero for some spectral ranges near 
the pulse peak. For channel ranges with sufficient signal-to-noise ratio, such as the later 
channels of Gate 1 and most channels of Gate 3, we found that fringing to the data within 
that gate, rather than using the model from Gate 2, produced identical results. Similarly, 
applying the model from other strong gates to Gate 2 yielded nearly the same results. 



4-2.4- Time and Frequency Averaging 



Effects of the finite sampling of the diffraction pattern are small, and are expected 
to be constant across the pulse. Sampling over finite intervals of time and frequency blurs 
together corre lated samples of the diffr action pattern, and so is indistinguishable from effects 
of source size (I Johnson &: Gwinnll2012l ). For given scintillation ti mescale tyss and scin tillation 



bandwidth Au, effects of sampling can be calculated analytically (IGwinn et al.ll2000l Section 
2.2). 

We estimate the scintillation timescale and bandwidth from our observations on the 
Mopra-Tidbinbilla baseline, using data near the pulse peak, IF2, Gatel, channels 2048 to 
3072. We find correlation functions for the ob served real part of interferometric visibility, 
and fit expected functional forms to them (see IGwinn et al.l |2000| . Equations 5, 7). Figure 
[9] displays the data and results. We have normalized the correlation functions using the 
amplitude and offset of the best-fitting functional forms. The data point at zero lag is 
omitted, because it is elevated by effects of noise and variations of the amplitude with time. 
Both correlation functions extend to large lag, off the right of the displayed panels; these are 
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included in the fits, but not displayed here. 



A fit of the expected Gaussian function to the temporal correlation function yields a 
scintillation timescale of tjss = 8.99 ± 0.17 s. Our sampling tim e of 2 sec is then ex pected to 
decrease the mean square flux density by a factor of f t = 0.994 (IGwinn et al.ll2000l . Equation 
8), while leaving the average flux density unchanged. The variation of ft from the standard 
error of the fit is less than 0.001. 

A fit of the expected Lorentzian function to the frequency correlation function yields a 
scintillation bandwidth of Av = 15.2 ± 0.5 kHz. Our channel bandwidth of 1.9 5 kHz is then 
expec ted to decrease the mean square flux density by a factor of // = 0.998 (IGwinn et al. 



2000l . Equation 6), leaving the mean flux density unchanged. The observed correlation func- 
tion shows systematic departures from the best-fitting Lorentzian function, with a sharper 
peak and elevated correlation at intermediate lag, with a peak near a lag of 70 kHz. A Kol- 
mogorov rather than a square-law struct ure function, a nd a departure of the sca ttering disk 
from isotropy, can produce such effects (lGwinnll200ll ; iJohnson fc Gwinnl 120121 ) . Temporal 
yariation of the emission on short timescales will also introduce correlations in frequency 
(IGwinn fc Johnson! 1201 ll ). Models including such effects will fit better. However, as the 
figure shows, the correlation falls to half its peak value at a lag of approximately 14 kHz, 
providing a clear characteristic scale. The variation of // from the standard error of tiss is 
less than 0.001. Even with Av = 12 kHz we still find f f = 0.998. 

Together, the effects of finite sampling in time and in frequency will increase the fitted 
size. Under the assumption that the source is pointlike, so that (kM6a) = 0, and we 
misinterpret the effects of averaging in time and frequency as source size, our sampling 
parameters lead to a fictitious size parameter of (kM8a) = 0.06, where we have related 
effects of fini t e sam pling to size using the expressio ns for modulat ion index from averaging 
Gwinn et al.l ( 120001 . Equation 9) and for source size iGwinnl ( 1200 ll . Equations 34, 35). This 
value is indeed nearly the minimum that we observe, near the pulse peak, as Figure |6] shows. 

The effects of finite sampling in time and frequency are constant with pulse phase, for 
a pointlike source. This is clearly inconsistent with our results discussed in Section H] above. 
Variation of amplitude with time can affect estimates of the scales of scintillation: variability 
on long timescales can aff ect the correlation with t i me, and on short ti mescales can affect the 
correlation in frequency (I Gwinn fc Johnson! l2011t IGwinn et al.l 120111 ) . Erroneous estimates 
will not produce variations of source size with pulse phase. Direct effects of source variability 
on estimates of source size are discussed elsewhere in this paper (Sections I3.4| I3.5.2p . 
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4.3. Alternative Models 

Our fit assumed a circular-Gaussian distribution of emission at the source. More pre- 
cisely, since the scattering disk is presumably elongated, we have fit the data with a model 
for a source with the same axial ratio as the scattering disk. This description simplifies the 
mathematics. Some of the residuals in Figure [H] show systematic residuals that suggest that 
a more complicated model might provide a better fit. A source elongated along a single di- 
rection provides a model arguably as simple as the one we use; the size inferred for such a fit 
is approximately v2 greater than the diameter for a circular Gaussian. A source of arbitrary 
axial ratio requires a more complicated model; we will discuss such models in conjunction 
with data on longer baselines. Of course, an infinite set of models can be fit to any finite 
data set; we choose the circular-Gaussian case as providing a simple parameterization that 
agrees well with our data. 

Another simple model is a "core-halo" model, including a pointlike core superposed 
with an extended halo. In the simplest case, this model would include a pointlike core 
and a halo so extended that it did not scintillate at all. Such a model would produce the 
distribution of visibility for a scintillating point source, as shown for example in the left 
panel of Figure HJ offset to the right, by convolution with the delta-function distribution 
of visibility expected for a non-scintillating source. To match our observations, the two 
components of this distribution would have to change their relative magnitudes over the 
course of the pulse, and to both disappear when the pulsar is "off": we see neither in the 
empty Gate 6. Moreover, the size of the "halo" would have to be large: much greater than 800 
km. We regard this model as more complicated, and less natural, than a circular-Gaussian 
model. Such a model appears to fit less well than a circular-Gaussian model at either side 
of the pulse maximum, where size and amplitude are both relatively large, although it fits 
significantly better than a zero-size model at some ranges of pulse phase. 



4.4. Comparison with A = 13 cm results 



We reported a size of the Vela pulsar's e mission region of approxim ately 300 km, at 
13 cm observing wavelength, in previous work ( Gwinn et al.lll997l . |2000| ). The three gates 
used there correspond roughly to ranges of pulse phase of —1.2 to 0.2 msec, 0.2 to 1.2 msec, 
and 1.2 to 3.7 msec, in Figure [6j We reported size parameters of (kM6a) 2 of 0.091 ± 0.009, 
0.070 ± 0.009, and 0.020 ± 0.020 in three gates across the pulse, at A = 13 cm observing 
wavelength. (Note: The exponent " 2" was omitte d in th e legend for these quantities in the 
first line of the body of Table 4 of iGwinn et al.l ( 120001 )). For comparison with the lower 
panel of Figure |6l the square roots of these measured quantities should be multiplied by the 
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ratio of wavelengths of the observations, 18/13. The resulting sizes are comparable to those 
inferred here, except for the third gate, for which the size at 13 cm wavelength is significantly 
smaller. The 13-cm gate extends well past the end of the pulse at 18 cm; the pulse is wider 
at the shorter wavelength, and flux density in the gate is significantly higher, because of the 
emergence of a second peak. This profile difference might be responsible for the emission 
size discrepancies. 

The observations and analysis at 13-cm wavelength differed in detail from those pre- 
sented here. One difference was that we assumed a circular source, rather than one with 
the same aspect ratio as the scattering disk; this would tend to increase the size parameter 
(kM8a) 2 estimated at 13 cm wavelength. We also used an exponential model; this is a rea- 
sonable approximation for the shorter baseline at shorter wavelength. The rey ised distance 



to the pulsar from parallax measurement, of D — 287 pc (IDodson et al.ll2003l ) rather than 
500 pc, changes the magnification factor M from 1.5 to 3.1 and so decreases the estimated 
size of the pulsar. The integration time was longer and the spectral-channel bandwidth was 
higher, as allowed by the increased decorrelation scales of the scintillation. However, the 
channel bandw idth was great en ough that the effects of averaging in frequency were non- 



negligible (see iGwinn et al.ll2000l Section 2.2). There were also fewer scintillation elements 
sampled. The model was fit to amplitude rather than to interferometric visibility; conse- 
quently, the effect of size on the distribution function is less simple. The model for self-noise 
omitted the quadratic term, although this omission was justified because the effects of finite 
source size appear at small amplitude, where the quadratic term is small. A 2-dimensional 
distribution of noise as given by bo and bi is critical for an accurate fit to the distribution of 
amplitude, however. 



4.5. Comparison with Pulsar Geometry 



The pulse profiles of most puls ars can be divided i nto "core" and "cone" components, 
after the inferr ed shape of the beam jRankinlll983lll990h . Vela is usually classified as a "core 
single" pulsar (IRankinl I19931 ). as might be expected from its young age and short period; 
its narrow pulse, with width nearly constant with observing wavelength; and its strong 
polarization at meter wavelengths. A second component emerges at obs erving wavelengt h 



A ~ 10 cm, although it is not clear that this represents a subsidiary peak (IKern et al. 



Vela fits the scaling relation of pulse width with period for conal pulsars well ( jRankin 



2000). 



1993|). 



This scaling matches the opening angle for a dipole field near the star's surface, suggesting 
that core-single emission arises from near the star's surface. The radio beam is not well 
aligned with the magnetic axes suggested by high-energy emission, as is not uncommon for 
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single-core components, raising the possibility of other geometry for this type of emission. 

Unlike most core-single pulsars, Vela shows an organized pattern of polarization; indeed, 
it was the archetyp e of the rotating-vector mo d el, which provides the fundamental pattern 
for conal emission (IRadhakrishnan et al.l Il969l ; iKomesaroffl 1 19701). The an gle between the 
magnetic axis and the line of sight is 6° in this model ( iJohnston et al.ll200ll ); if one assumes 
that emission arises where the last open magnetic field lines are tangent to the line of sight, 
and that the rotation axis lies in the plane of the sky, the altitude of emission is 200 km. 
The inferred altitude is significantly greater if the rotation axis is not in the plane of the sky. 



Krishnamohan fc Downs! (119831 ) analyzed the polarization properties of the Vela pulsar, 
in bins in amplitude of individual pulses. They used the mapping of polarization onto dipole 
field geometry, and the difference in arrival time, to infer the location of four distinct pulse 
components. They found that earlier emission components arise from higher above the 
neutron-star surface than later components, with a spread in altitudes of 500 km. 

In contrast, our measurements are sensitive to the lateral extent of the emission region, 
rather than its altitude. These two are related by the height of the emission region and, 
plausibly, the geometry of the magnetic field. Detailed discussion is beyond the scope of 
this paper, but we note that the inferred lateral size s tend to suggest emission altitudes 



at least as great as those inferred from field geometry ( IDyks et al.l 120041 : iGangadharal 12005 
Johnson et al.ll2012l and references therein). 



The observed size might result from refraction or scattering in, or near, the pulsar's 
emission region. The last closed field lines provide zones where plasma waves might grow, and 
then then propagate along the field until converted to electromagnetic waves (IBarnard fc Arons 
19861 ). Such an emission process might preserve much of the geometry and tempora l varia- 
tions of the emission, while translating it to a higher altitude ( jHirano fc Gwinnll200ll ). Inter- 
estingly, polarization appears to be perpendicular to the curvatu re of the magne tic field lines 
for the Vela pulsar, which suggests reprocessing after emission (ILai et al.l 120011 ) . Individual 
samples of the electric field of the pul sar show log- normal statistics, suggestive of multiple, 
random contributions to amplification (jCairns. Johnston. &: Dasll20031). Radiation emitted a t 



low altitudes may also be refracted or scattered at higher altitude ( ILyutikov fc Parikh 



Reprocessing might include interaction with X-ray and gamma-ray emission (IHarding et al. 



200 



20081 ; iPetroval 120091 ) . Thus, a variety of evidence suggests that the site of the initial emis- 
sion may be distinct from the location at which the radiation is released to propagate, with 
scattering in the interstellar medium, to the Earth. 
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5. SUMMARY 

We have described analysis of the distribution of visibility of the Vela pulsar at A = 18 
cm wavelength. We observed the pulsar in two IFs of 16 MHz bandwidth each. We observed 
the pulsar in 5 gates across the pulse, each 1 msec wide. The signal was not dedispersed, 
so higher frequencies sampled later pulse phase than lower frequencies. We fringed the data 
and from it formed distributions of visibility projected into bins along the real axis V (Re[V]) 
and the second moment of imaginary part in each bin Q (Re[V]). 

We calculated a theoretical model for the distribution of visib ility. This mo del used the 



distribution of interferometric visibility for a scintillating source (IGwinnl 120011 ) . We calcu- 
lated models for a source of zero size, and for a source of small but nonzero size. We assumed 
a Gaussian distribution of flux density with the same axial ratio as the scattering disk. We 
then added the effects of noise and self-noise using convolution with a non-stationary kernel. 
We modeled noise with a second-degree polynomial in phase with source visibility, and the 
linear terms of that polyno mial at quadrature , as predicted by theoretical models for noise 



and exhibited by our data ( jGwinn et al.l 120121 ). We also included a parameter for effects of 



intrinsic amplitude variability of the source, on timescales longer than the 2-sec integration 
time. We find that effects of size appear as a W-shaped difference of the best finite-size model 
from the best zero-size model, as a consequence of the shape of the underlying distributions 
of visibility. 

We fit our models to the observed distributions using the Levenberg-Marquardt method. 
We weighted residuals by Poisson statistics for V and by statistics appropriate for mean 
square elements of a Gaussian distribution for Q, with constant weight below a cutoff of 
iV < 100 samples in a bin. We searched parameter space using a grid search with the 
exponential p = 1 form for the visibility distribution, and found that the variations of the 
summed, squared weighted residuals was quite simple. We used the minimum parameters 
from this search as initial parameters for Levenberg-Marquardt iterative fits. 

Residuals to our fits for zero size show the characteristic W-shaped signature of finite 
size. Independent fits for finite size describe the data well, matching the data and presenting 
the expected form when differenced with the zero-size model. We find that the resulting 
inferred, intrinsic size of the pulsar emission region first decreases across the pulse, then in- 
creases. The maximum size is approximately 800 km (FWHM of our Gaussian distribution), 
and the minimum is near zero. The reduction in residuals ranges up to 10%. The statistical 
significance of including the size parameter is high, 40-<r or more in some cases. Residuals 
are dominated by Poisson noise, from the finite number of samples in each bin. 

To help visualize effects of the fits, we binned the residuals after fitting. Binning reduces 
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Poisson noise, and removes the relatively rapid variation of the distribution, leaving the 
slower variations from source size. The results show that the distribution of visibility matches 
the W-shaped signature of finite source size well, as we observe in a test case even before 
binning. After removing most Poisson noise by binning, introduction of the fitted size 
parameter reduces residuals by as little at 30% and as much as 92%. Some gates show 
evidence for residual systematic differences of the distribution from the model. We consider 
various systematic effects that could modify our results, and compare results with previous 
observations at A = 13 cm observing wavelength. We briefly compare our results with 
previous work on geometry of pulsar emission. 
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1 2 3 4 5 

Pulse Phase (gate) 



Fig. 1. — Pulse amplitude as a function of pulse gate and spectral channel. Plotted is 
the average real part of the visibility for IF2, all gates and channels, from 19:10 to 21:13 
UT, boxcar-averaged over 20 spectral channels. Gates and spectral ranges are indicated 
by alternating circles (odd) and crosses (even). Because of dispersion, spectral channel 
corresponds to pulse phase within a given gate; successive gates are offset by the gate width 
of 1 ms. Scale shows centers of channel ranges in multiples of 1024 for Gate 2. The different 
gains, from changes in electric-field variance with fixed quantizer thresholds, broaden the 
curve where gates overlap. 
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Fig. 2. — Dynamic spectrum, showing the real part of the visibility for a short frequency 
and time interval in 3 gates for IF1. Amplitudes of the grayscales were equalized using the 
average real part in this spectral range, as shown in Figure [Q 
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Fig. 3. — Left panels: Observed distributions of visibility V for IF1, gate 1, channels 4096 to 
5120. Upper: Distribution. Lower: Residuals to best-fitting model with zero size for pulsar. 
Curve shows best-fitting model with finite size. Right panels: Observed distributions of 
mean squared imaginary part Q for the same data. Upper: Distribution. Lower: Residuals 
to best-fitting zero-size model, and curve for difference of finite-size model from zero-size 
model. 
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Fig. 4. — Model distributions of visibility of a scintillating source in the absence of noise, for 
a pointlike source (left) and for a source of finite size (right). Distributions assume amplitude 
K = 0.006 and correlation p = 0.986. 
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Fig. 5. — Comparison of model distributions V for IF1, gate 1, channels 4096-5120. Upper 
panel: Projections for model without noise. Dotted curve shows projection for pointlike 
source (kM0a = 0), solid curve for finite-size source [kMQo = 0.21). Middle: Projections 
including noise. Addition of the effects of noise makes the model curves nearly indistinguish- 
able. Lower: Difference of model curve for finite size from zero size, including noise. The 
difference shows that the model with finite size has lower probability at negative Re[V] and 
larger positive Re[V], but larger probability at intermediate Re[V]. 
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Fig. 6. — Best-fitting amplitude k + 2«i (top panel) and normalized source size (kMOa) 
(lower panel) plotted with pulse gate, for 4 gates in 6 spectral ranges. The model for the 
emission region assumes a circular Gaussian distribution of emission. Crosses show IF1, 
circles show IF2. Right-hand axis on lower panel shows full width at half maximum of size 
of emission region in km, estimated as described in Section 14.1.31 
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Fig. 7. — Plots of the binned differences of the data from the best-fit zero-size model (dotted 
histogram), and of the best-fitting finite-size model from that zero-size model (solid curve). 
Agreement of the solid curve with histogram shows that finite-size model can explain many 
of the binned residuals. Data are for IF1, gate 1, channels 40965120, as shown in Figure 3. 
The curves are identical to those shown in the lower panels of Figure 3, and the curve in the 
right panel is identical to that shown in the lower panel of Figure [5j 
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Fig. 8. — Binned residuals in 3 gates for different pulsar gates and spectral ranges from that 
shown in Figure [7J Left panels show residual 5V, right panels show SQ. Legends in panels 
indicate IF, gate, and channel ranges. The top panels show the same channel and gate range 
as Figure UJ but in the other IF. 
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Fig. 9. — Correlation function of real part of visibility in time (left panel) and in frequency 
(right panel), with best-fitting theoretical forms for the short Mopra-Tidbinbilla baseline, 
near the peak of the pulse at IF2, Gate 2, channels 2048 to 3072. Correlation functions 
extend to large lags, not shown but included in the fits. 
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Table I. Best-Fit Parameters: IF 1, Gate 1, Channels 4096-5120 



Parameter 


Fitted Size 


Zero Size 


Type 


Symbol 


Value (Std. Error) W 


Value (Std. Error) W 


Noise 




0.0000578(4) 


0.0000618(6) 






0.00450(8) 


0.00381(17) 






0.170(17) 


0.0662(19) 


Variability*^ 


K) 


0.07705(4) 


0.1045(18) 


Scale 




0.00566(2) 


0.00612(4) 


Size Parameter 


(kM9a) 2 


0.0423(2) 


= 



•^Standard error shown in parentheses for last digits of quoted value. 

•^Includes effects of intrinsic variability on timescales between accumulation 
time of 512 //sec and integration time of 2 sec. 

•^Includes effects of intrinsic variability on timescales longer than integration 
time of 2 sec. 



